Generalization of Force Control Policies from Demonstrations for Constrained Robotic Motion Tasks - A Regression-Based Approach

نویسندگان

  • Vasiliki Koropouli
  • Sandra Hirche
  • Dongheui Lee
چکیده

Although learning of control policies from demonstrations has been thoroughly investigated in the literature, generalization of policies to new contexts still remains a challenge given that existing approaches exhibit limited performance when generalizing to new tasks. In this article, we propose two policy generalization approaches employed for generalizing motion-based force control policies with the view of performing constrained motions in presence of motion-dependent external forces. The key concept of the proposed methods is using, apart from policy values, also policy derivatives or differences which express how the policy varies with respect to variations in its input and combine these two kinds of information to generalize the policy at new inputs. The first proposed approach learns policy and policy derivative values by linear regression and combines these data into a first-order Taylor-like polynomial to estimate the policy at new inputs. The second approach learns policy and policy difference data by locally weighted regression and combines them in a V. Koropouli is with the Institute of Automatic Control Engineering, Technische Universität München, Karlstr. 45, 80333 Munich, Germany Tel.: +49-89-28926885 E-mail: [email protected] S. Hirche is with the Institute for Information-Oriented Control, Technische Universität München, Barer str. 21, 80333 Munich, Germany E-mail: [email protected] D. Lee is with the Institute of Automatic Control Engineering, Technische Universität München, Karlstr. 45, 80333 Munich, Germany E-mail: [email protected] 2 Vasiliki Koropouli et al. superposition fashion to estimate the policy at new inputs. The policy differences in this approach represent variations of the policy in the direction of minimizing the distance between the new incoming and average-demonstrated inputs. The proposed approaches are evaluated in real-world robot constrained motion tasks by using a linear-actuated, two degrees-offreedom haptic device.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Constrained Generalizable Policies by Demonstration

Many practical tasks in robotic systems, such as cleaning windows, writing or grasping, are inherently constrained. Learning policies subject to constraints is a challenging problem. We propose a locally weighted constrained projection learning method (LWCPL) that first estimates the constraint and then exploits this estimate across multiple observations of the constrained motion to learn an un...

متن کامل

Manipulation Control of a Flexible Space Free Flying Robot Using Fuzzy Tuning Approach

Cooperative object manipulation control of rigid-flexible multi-body systems in space is studied in this paper. During such tasks, flexible members like solar panels may get vibrated that in turn may lead to some oscillatory disturbing forces on other subsystems, and consequently produces error in the motion of the end-effectors of the cooperative manipulating arms. Therefore, to design and dev...

متن کامل

Learning Dexterous Manipulation for a Soft Robotic Hand from Human Demonstration

Dexterous multi-fingered hands can accomplish fine manipulation behaviors that are infeasible with simple robotic grippers. However, sophisticated multi-fingered hands are often expensive and fragile. Low-cost soft hands offer an appealing alternative to more conventional devices, but present considerable challenges in sensing and actuation, making them difficult to apply to more complex manipu...

متن کامل

Trajectory Optimization of Cable Parallel Manipulators in Point-to-Point Motion

Planning robot trajectory is a complex task that plays a significant role in design and application of robots in task space. The problem is formulated as a trajectory optimization problem which is fundamentally a constrained nonlinear optimization problem. Open-loop optimal control method is proposed as an approach for trajectory optimization of cable parallel manipulator for a given two-end-po...

متن کامل

Learning Deep Policies for Physics-Based Manipulation in Clutter

Uncertainty in modeling real world physics makestransferring traditional open-loop motion planning techniquesfrom simulation to the real world particularly challenging.Available closed-loop policy learning approaches, for physics-based manipulation tasks, typically either focus on single objectmanipulation, or rely on imitation learning, which inherentlyconstrains task g...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of Intelligent and Robotic Systems

دوره 80  شماره 

صفحات  -

تاریخ انتشار 2015